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MODIFIED LOG SOBOLEV INEQUALITIES 
FOR CONVEX FUNCTIONS ON THE REAL LINE. 
SUFFICIENT CONDITIONS 

RADOSLAW ADAMCZAK AND MICHAL STRZELECKI 


Abstract. We provide a mild sufficient condition for a probability measure 
on the real line to satisfy a modified log-Sobolev inequality for convex func¬ 
tions, interpolating between the classical log-Sobolev inequality and a Bobkov- 
Ledoux type inequality. As a consequence we obtain dimension-free two-level 
concentration results for convex function of independent random variables with 
sufficiently regular tail decay. 

We also provide a link between modified log-Sobolev inequalities for convex 
functions and weak transport-entropy inequalities, complementing recent work 
by Gozlan, Roberto, Samson, and Tetali. 


1. Introduction and main results 

1.1. Background. Over the past few decades the concentration of measure phe¬ 
nomenon has become one of the main themes of high dimensional probability and 
geometric analysis, with applications to limit theorems, non-asymptotic confidence 
bounds or random constructions of geometric objects with extremal properties. 
While initial results on concentration of measure concerned mostly deviation bounds 
for Lipschitz functions of highly regular random variables, the seminal work by 
Talagrand [22] has revealed that if one restricts attention to convex Lipschitz func¬ 
tions, dimension-free concentration of measure holds under much weaker conditions, 
e.g. for all product probability measures with bounded support. Talagrand’s ap¬ 
proach relied on his celebrated convex distance inequality related to the analysis of 
isoperimetric problems for product measures. In subsequent papers other authors 
adapted tools from the classical concentration of measure theory, such as Poincare 
and log-Sobolev inequalities or transportation and infimum-convolution inequalities 
[nisiiiiiiiiiiiiiniii], for proving tail inequalities for convex functions. More re¬ 
cent work nnmi] presents deep connections between various proposed approaches. 

Talagrand’s inequality provides a subgaussian tail estimate for convex Lipschitz 
functions of independent uniformly bounded random variables (with dimension- 
independent constants). It is natural to ask whether such a result can hold under 
weaker assumptions on the tails of underlying random variables. One can easily see 
that it is not enough to assume a uniform subgaussian tail decay and some stronger 
regularity is necessary inn]- In [^ Bobkov and Gotze show that for a probabil¬ 
ity measure on the real line dimension-free concentration for convex functions is 
equivalent to the convex Poincare inequality and further to a certain regularity of 
the tail (see also [10] for a refinement of this result in the setting of abstract metric 
spaces). Moreover the concentration rate in this case is at least exponential. In [T] 
a related assumption on the tail is shown to imply the convex log-Sobolev inequality 
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which allows to obtain dimension-free deviation bounds on the upper tail of convex 
functions. 

The goal of this article is to show that tail conditions interpolating between 
those considered in and [1] lead to modified log-Sobolev inequalities for convex 
functions (convex counterparts of inequalities introduced by Bobkov-Ledoux [3] and 
Gentil-Guillin-Miclo Eli) and thus also two-level deviation bounds interpolating 
between the subexponential and subgaussian tail decay, in the spirit of Talagrand’s 
inequalities for the exponential or Weibull distribution EIIIIH]- Furthermore we 
demonstrate a connection of modified log-Sobolev inequalities to weak transporta¬ 
tion inequalities. In the case of the classical log-Sobolev inequality such a link has 
been recently discovered by Gozlan, Roberto, Samson, and Tetali m- 

1.2. Main results. To formulate our results we need to introduce the following 

Definition 1.1. For /3 € [0,1], m > 0 and cr > 0 let A4/3(m, denote the class 

of probability distributions /r on R for which 

Vx>m oo)) < oo)), 

i^“((-oo,a;]) < (T^+V((-oo,x]), 

where is the measure on [to, oo) with density ^{[x, c»)) and h'~ is the measure 
on (— 00 ,—to] with density |a;|^/r((—oo, a;]). 

As one can easily see, the condition defining the class Mp{m, is equivalent 

to some type of regular tail decay. More precisely, we have the following proposition, 
the proof of which is deferred to Section [31 

Proposition 1.2. Let ^ be a probability measure on M. For /3 G [0, 1] and to > 0 
the following conditions are equivalent. 

(i) There exists a >0, such that p G 

(ii) For some h > 0, a < 1 and all x > m, 

p{[x -I- h/x^ , oo)) < ap{[x, oo)), 

/i((—00, —X — h/x^]) < a/r((—00, —a;]). 

Moreover, the eonstants in (i) depend only on the eonstants in (ii) and vice versa. 
For instance, if (i) holds, we can take h = and a = 1/2. 

Let us recall that a probability measure p on R” satishes the Poincare inequality 
with constant C for the class A of real valued functions on R" if for all f £ A, 

Var/(A)<C'E|V/(A)|2, 

where A is a random vector with law p. The measure p satisfies the log-Sobolev 
inequality with constant C for the class A, if for all f £ A, 

Ente^^'^) < C'E|V/(A)pe^(^), (1.1) 

where Ent Y denotes the usual entropy of a non-negative random variable Y, i.e. 

Ent Y = EE log Y — EE log EE 

if EE log E < oo and Ent E = oo otherwise. 

We remark that if A consists of all smooth functions, one usually states the 
log-Sobolev inequality in the equivalent form 

Ent/(A)2<DE|V/(A)|2, (1.2) 

where D = AC. We have decided to use the form uni), since our main motivation 
is the study of concentration properties for convex functions (note that for A being 
the class of convex functions (ED and (ED are no longer equivalent). 
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Using Proposition 11.21 and the results from one easily obtains that the con¬ 
dition fi S j\4o(m,a) for some m,a is equivalent to the Poincare inequality for 
convex functions and further to dimension-free subexponential concentration for 
convex Lipschitz functions on R" equipped with the measure /x®". On the other 
hand as shown in [T] the condition A4i(m, a^) implies the log-Sobolev inequality 
for convex functions and a dimension-free subgaussian inequality for upper tails of 
convex Lipschitz functions on see Corollary II.71 below. 

In what follows || • ||p denotes the £p norm, i.e. for x = (xi,... ,Xn) G R", 
||x||p = (|a;i|P -f ... -f \xn\^Y^^- For p = 2 we will write simply |a;| instead of ||x|lp. 
The symbol C{X) stands for the law of a random variable X. 

Our main result is the following theorem. 

Theorem 1.3. Let Xi ,..., be independent random variables such that C{Xi) G 
and letp: R” —>■ R 6e a smooth convex function. Then, for (3 € (0,1], 

Ente‘^('^o-.x„) 

< C(/3,m, a)E(|V^(Xi,... V ||V^(Xi,... 

If ft = 0 and \diip{x)\ < l/(2(m -|- 3 ct)) for i G {1,... ,n} and all x G R", then 

Ente‘^(-^i’-’'^") < CiP = (1.4) 

Remark 1.4. Note that in the case /3 = 1, CH) reduces to the classical log-Sobolev 
inequality 

Ent < C{m, fT)E|Vv?(Xi,..., 

The case (3 G (0,1) corresponds to the modified log-Sobolev inequality introduced 
and studied by Gentil, Guillin, Miclo [5] for the class of smooth (not necessarily 
convex) functions. The inequality dm) for smooth functions was introduced and 
studied by Bobkov and Ledoux [3]. 

Remark 1.5. In the above theorem and the following corollaries the assumption 
about convexity of the function ip can be relaxed to separate convexity (cf. [15)1. 

By a version of Herbst’s argument we obtain the following corollary, concerning 
the tail behaviour of smooth convex functions. Below C denotes the constant 
C{I3, TO, cr) from Theorem 11.31 

Corollary 1.6. Let Xi ,..., be independent random variables such that C{Xi) G 
and let <p\ R” —>■ R fee a smooth convex Lipschitz function. Then, for 
(3 G (0,1] and all t >0, 

P((/?(Xi, . . . , Xn) > E99(Xi, ..., Xn) + t) 

/ 3 . r t^ IN 

< exp ( - - mm 1^—. 

If j3 = 0, then for all fe > 0, 

P((/?(Xi, . . . , Xn) > ¥,ip{Xi, . . . , Xn) + t) 

“ ^ ( 4 ' { Csup |V(/3P ’ (to-I- 3cr) sup ||V(/7||oo 

Using standard smoothing arguments one can also obtain a result for not neces¬ 
sarily smooth functions, expressed in terms of their Lipschitz constants with respect 
to the £2 £'^_^_p 


norms. 
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Corollary 1.7. Let Xi,..., Xn be independent random variables such that C{Xi) G 
and let R” —>■ K &e a convex function such that 

\ip{x) - ip{y)\ < L 2 \\x-y \\2 

and 

\^{x) - ^p{y)\ < Li+pWx - yWi+fi 

for all x^y G and some L 2 ,< 00 . Then, for jd G (0,1] and all t >0, 

/ 3 ( '1 \ 

¥{ip{Xi,...,Xn) > + i) < exp 

If P = 0, then for all t > 0, 

, X„) > mx ,,..., X„) + f) < exp ( - i mi„ { . 

Finally, we can express concentration in terms of enlargements of convex sets. 
The proof of this corollary is a modification of the argument for smooth functions, 
taking into account some additional difficulties related to the fact that in the convex 
situation one cannot truncate the function (as this operation destroys convexity). 

Corollary 1.8. Let Xi ,..., Xn be independent random variables such that C{Xi) G 
and let A G R" be a convex set with P((Xi,..., X„) G x4) > 1/2. 
Then, for all r > 0, 

P((Xi, ...,Xn)^A + r^^^B2 + 

for some constant C depending only on /3 and C{P,m,a) from Theorem \l.A (and 
additionally on m + 3a in the case P = 0). 

The Reader may compare the above corollaries with a general result stated in 
[H Proposition 26, Theorem 27] about consequences of inequalities similar to (11.31) 
in the classical setting of smooth functions (see also [5J Example 29]). 

1.3. Relation to other results, and further questions. Recently, the classical 
log-Sobolev inequality for convex functions has been shown to be equivalent to 
certain transport-entropy inequalities with weak transport cost m- In Propositions 
O and 13.61 we show that this is also the case for inequalities dESl) and dnj). 
Since the precise formulation of these results requires introducing additional, rather 
involved notation, we postpone it to Section [31 

Logarithmic Sobolev inequalities for convex functions have one weakness when 
compared to their counterparts for smooth functions, namely they provide only 
deviation inequalities for the upper tail of the function (in the classical setting one 
obtains bounds on the lower tail by applying the inequality to —tp, an approach 
which is precluded in our situation, since except for trivial cases —ip is not convex). 

It turns out that for /3 = 0 the condition p, G Mp{m, cr^+^) is in fact equivalent to 
two-sided concentration inequality for convex functions. This is a consequence of a 
recent result by Feldheim, Marsiglietti, Nayar, and Wang [7] who proved an infimum 
convolution inequality, which as follows from our Proposition 13.61 is stronger than 

dSl). 

1.3.1. A remark about recent improvements by Gozlan et al. During the review 
process of this manuscript a preprint m by Gozlan, Roberto, Samson, Shu, and 
Tetali appeared, in which the Authors extended the results by Feldheim et al. and 
characterized the infimum-convolution inequality (equivalently weak transportation 
inequality) for probability measures on the real line and a large class of cost func¬ 
tions. Their condition can be shown to be strictly weaker than ours, while (as 
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follows from Proposition 13.II) the inequalities they consider are stronger than mod¬ 
ified log-Sobolev inequalities investigated by us. The condition of m is expressed 
in terms of the monotone transport map U between the symmetric exponential 
distribution and the measure fi and in the case we consider reads as 

sup{U{x + u)- U{x)) < C{1 + 

for some C < oo. Using the same arguments as in Lemma 12.21 below, one can show 
that this is implied by /r G Mp{m, cr^+^) for some m, On the other hand, for 

measures supported on the positive half-line, the latter condition is equivalent to 

sup(U(a; -f - U{xY+^) < D{1 + u) 

for some D < oo and so it is not difficult to find a measure which satisfies the 
condition of m but does not belong to A^/ 3 (m, for any m, cr^+^. 

To the best of our knowledge, except for the case of /3 = 0, the characterization 
of measures on the line satisfying the modified log-Sobolev inequality for convex 
functions is still unknown. For /3 = 0 these are exactly the measures p,, which 
belong to the class A4o(m,a) for some m and a (this follows e.g. from Theorem 
o and the characterization of the convex Poincare inequality due to Bobkov and 
Gotze 0). We provide a more detailed discussion of the results of izma and their 
relation to our results in Section [3] (see Remark [3^ . 

1.4. Organization of the paper. Section [2] is devoted to the proofs of our main 

results presented above. In Section [3] we discuss connections with transportation 
inequalities and in Section 0] we briefly comment on the relation between the class 
Mp{m, and the class of functions satisfying the modified log-Sobolev inequal¬ 

ity for all smooth functions. 

1.5. Acknowledgments. We would like to thank Ivan Gentil and Arnaud Guillin 
who back in 2006 during a trimester Phenomena in High Dimensions held at the IHP 
in Paris introduced the first-named author to their theory of modified log-Sobolev 
inequalities and raised a question concerning their convex counterparts. Some ideas 
used in the proof of the /3 = 0 case of our results originated in those conversations. 
We also thank Piotr Nayar for communicating to us the recent results of [7]. Finally, 
we thank the Anonymous Referee for a very careful reading of the first version of 
this article, for pointing out that it is possible to obtain hypercontractive estimates 
in Section [21 and for many helpful comments. 

2. Proofs of the main results 

2.1. Technical lemmas. In this section we will state and prove several techni¬ 
cal lemmas which will be used in the proofs of our main results. Of particular 
importance is Lemma 12.41 which constitues the core of the proof of Theorem 11.31 
In the proofs we will use the notation C{a) to denote constants depending only 
on some parameter a. The value of such constants may change between occurrences. 
Let us start with the proof of Proposition II.21 stated in Section o 

Proof of Provosition \T7^ Assume (i) holds. For x >m we have 

a^~'''^fj,{[x,oo)) > iy+{[x,oo)) > / fi{[y,oo))dy 

J X 

> - p — • x^y.{[x + 2(T^+^/a;^, oo)), 

which clearly implies the first inequality of (ii). The second inequality follows 
similarly. 
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Suppose now that (ii) is satisfied. For x > m define the sequence oq = x, 
On+i = an + h/a^. It is easy to see that this sequence is increasing, a„ —>■ c» and 
a„+i/a„ —>■ 1. Therefore 

poo ^ 

j/+([a;,oo)) = / y'^/r([y,oo))(iy < ^(o„+i - a„)af+i^([a„,oo)) 


n—0 


< iC V a”/r([ao, oo)) = —oo)), 

‘ ^ — rv 


n—0 


1 — a 


where K = sup (a„+i — a„)af_|_;^ = hsup (a„+i/a„)^ = /isup(l + < 

h{l + . We can proceed analogously to obtain the condition on the left 

tail. □ 

Lemma 2.1. Let ^ Then for all functions /:!£—>■ [0, oo) 

which are nonincreasing for x < xq and nondecreasing for x > xq (for some xq £ 
[—oo, oo]^ we have 

nOO P 

f{x)x^^{[x,<x>))dx<(T^^^ / f{x)dfi{x), 

Jr 

where fh = mM j (m V ((2/3)^/(^+^V))^. 

Proof. First notice that the inequalities of Definition 11.11 are also satisfied for sets 
of the form (a;, oo), x > m. We have 

pOO pOO pOO 

I f{x)x^fj,{[x,oo))dx= I / l[s<f{x)}X^T{[x,oo))dsdx 
J m J m Jo 

poo 

= / > m : f{x) > s})ds. (2.1) 

Jo 

The set A = {x > fh : f{x) > s} is either a (possibly empty) half-line contained in 
[m, oo] or a disjoint union of such a half-line and an interval / with the left end equal 
to fh. In the former case we have r'~^{A) < ^{A) < g.[{x £ M : f{x) > s}). 

In the latter case we denote the right end of I by t. Let a = mV ((2/3)^/(^+^V), 
so that fh = a + fa^. Since t > fh> a we obtain 

< i^'''([fh, oo)) < fj,{[fh,oo)) < gL{[a,fh)) 

< cr^+V([a,0) < cr'^+V({a: < t : f{x) > s}), 

where the second inequality follows from the assumption /i £ A4p{m,cr^~^^), the 
third from Proposition 11.21 and the definition of fh (which imply that /i([a,m)) = 
/i([a,oo)) — /i([TO, oo)) > /J,([m, oo))), and the last one from the observation that 
Xq >t and thus the function / is nonincreasing for x < t. 

Hence, in both cases we have i^~^{A) < a^^^fi{{x £ K.: f{x) > s}). Now we can 
write 

^/5+i 


f gL({x £ R : f(x) > s})ds = f f{x)dn{x), 

Jo Jr 


□ 


which together with (EH ends the proof. 

Lemma 2.2. If X is a random variable such that C{X) £ then 

P(|^| >t)< for all t > 0. 

Proof. Obviously it is sufficient to prove the inequality for t > Am. Define con¬ 
tinuous functions g{x) = y^¥{X > y)dy and f{z) = x^g{x)dx. By the 

definition of Al/ 3 (m, ct^“'"^), for x> m we have 

g{x) < cr^+^P(X > x). 
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Thus, for 0 > 


poo poo 

f{z)= / xf^g{x)dx< / x^a^+^f{X>x)dx = af^+^g{z). 

J Z J Z 


We can rewrite this as 


f{z)<--^^z^f{z), z>m, 


which gives f{z) < Cexp(— z^+^/((/3 + l)cr^+^)) with C depending only on /3, 
m and (note that f{m) < Now, as g is nonincreasing, we have 

f{z) > ^ y^g{y)dy > z^^^g{2z) and hence for z > to we get 


gi2z) < 4^ < + 1)^'"+')) 


?/3+l 


r/^+1 


Similarly 


P(X > 4x) < + 1)^^+')) 


(2x)/5+i - 2/5+ia;2/3+2 

for X > m. 

We can analogously deal with the lower tail. 


□ 


The next two lemmas contain the core of our argument and are counterparts of 
Lemmas 4 and 3 in [T] . While Lemma 12.31 follows quite closely the argument from 
[I], the subsequent Lemma [2.41 requires for /3 < 1 a significantly new approach. 

Lemma 2.3. Let tp: R —> R &e a smooth convex Lipschitz function, nonincreasing 
on (—00,0), and X a random variable with C{X) G Then there 

exists a constant C3{/3,m,a) such that 


{(ai,'y)GlR^:xy<0)} 


(fi'{x)(p'{y)e‘^^'^'^]P{X < X A y)^[X > xV y)dxdy 


< C3{j3,m,a)¥,Lp'{Xfe‘^‘^^\ 


Proof. For /3 > 0 we will assume without loss of generality that to > 1. Let to be 
the constant defined in Lemma [2.11 (note that to > 1). For x < 0 < y we have 
either (p'(x)ip'(y) < 0 or ip'(x) < (p'{y) < 0 and ip{x) > (p{y), so 

pO poo 

/ / ip'{x)ip'{y)e'^’'y'>¥{X < a;)P(X > y)dydx 


' —oo Jo 


< 


ip'{xfe^^^'^f‘{X < a;)P(X > y)dydx 


' —OO J 0 


pO 

<C{l3,m,a) / (p'{x)'^e‘^^^'*F{X < x)dx 

J —OO 

where the last inequality follows from the fact that by Lemma \T. 

pOO 

/ F{X > y)dy = EX_|_ < C'(/3,to, a). 

Jo 

The integral over the set where x > 0 > y can be treated similarly, namely we have 

poo pO 

/ / (fi'{x)(p'{y)e‘^^^'^F{X < y)F{X > x)dydx 


ooo pO 


y}'{yfe'^^y'>F{X < y)F{X > x)dydx 

Jo J —oo 

< C(^, TO, cr) / ip'{iiYe'^^y"'F{X <y)dy. 

J — OO 
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Now, by Lemma l2.ll (note that —X G and the function x H> 

is nonincreasing for x < xq and nondecreasing for x > Xq for some 

xo G [-00,0]), 



(p'(x)^e‘^^^^F(X < x)dx < 



(fi'(x)'^e‘^^^\—x)^F{X < x)dx 


< cr^+^E(^'(X)2e‘^W. 


Moreover 

rO 


f ip'{x)‘^e‘^^^''¥{X < x)dx = E / ip'{x)^e‘^^^^l[x<x}dx < rnEip'{X)^e‘^^^\ 

J —m J —rh 


where the last inequality follows from the fact that ip'{xY< p'{XY for 
X < x <0. This ends the proof of the Lemma. □ 


Lemma 2.4. Let tp: M —>■ R &e a smooth convex Lipschitz function and X a random 
variable with C{X) = p G Aipim, // /3 > 0, then there exists a constant 

C'4(/3, TO, cr) such that 



p'{x)p'{y)e‘^^''^^P{X < X A y)P{X > xV y)dxdy 

< C4(^,TO,a)E(|v.'(X)|2 V \p'{X)\‘^)e 




( 2 . 2 ) 


///3 

that 


0 and p' < 1/(2(to + 3ct)), then there exists a constant CY/d = 0,to,ct) such 



p'{x)p'iy)e'^^y^FiX 


< X A y)F{X > x V y)dxdy 


< CYP = 0,TO,CT)E|v9'(X)|2e‘^W. 


Proof. Let us first notice that the left-hand side of f|2.2D is equal to 
ry 

I p'{x)p'{y){e^'^'''^ + e‘^(^))P(X < a;)P(X > y)dxdy 
0 

(since on the set {(x, y) : x > y} we can make a change of variables swapping x 
and y). 

Since p is convex there exists a point xq (possibly 0 or infinity) at which p attains 
its infimum on [0, ooj. Let to be the number defined in Lemma l2.11 Obviously, for 
P > 0 we can assume that to > 1 (note that we do not change the value of m in 
the case /3 = 0). We split the outer integral into integrals over intervals (0, xq A to), 
(xo A TO, Xo V to) and (xq V m, oo). 

Step 1. Since p is nonincreasing on the interval (0, xq A to) we have 

p'ix)p'iy)ie‘^^='^ + e^(^)) < 2p'{xY 



for 0 < X < y < Xo A TO. Therefore 


f'XoAm ny 


< 


(^'(x)(/?'(y)(e^(“) + e^(^))P(X < x)P(X > y)dxdy 

/•xoAfh ry 

2 / v9'(x)2e^("^)p(X < x)P(X > y)dxdy 

Jo Jo 


< 2mE 


f'XoAm 


p'{xYe‘^^^'’^{x<x}dx < 2mPEp'(XY 


(2.3) 


where the last inequality follows from the fact that p'{xY < p'{XY for 
X < X < Xq. 
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Step 2 . To estimate the integral over the interval (xq A m, xq V m) we consider 
two cases. If xq < fh, then 


l-m i-y 

I Xq Jo 


< x)P(X > y)dxdy 

rm py 

/ (/?'(?/)^e‘^^^^P(X > y)dxdy 


< 2 


pm 

< 2 mE (p\y)^e‘^^yh{x>v}dy ( 2 . 4 ) 

J Xo 


If Xq > m, then similarly as above (the third last passage follows by Definition ll.l 
and the last by Lemma I^Tl for the third passage recall also that m > I for /3 > 0 ) 


Xo rv 
rh Jo 

<2f 


= 2 

< 2 


ip'{x)y)'{y){e'^^^'^ +e‘^^^^)P(X < x)P(X > y)dxdy 
Co rv 

I p'ixfe'^^^^EiX < x)P(X > y)dxdy 
p'{xfe^^^'>F{X < x)F{X > y)dydx 

poo 

p'ixfe'^^^^PiX <x) I /P(X > y)dydx 

J x\/rh 


/ m Jo 

pxo pxo 


0 Jx\/m 

rXQ 


fXQ 

< 2 fT^+i / p'{xfe‘f’^^'>F{X < x)P(X > x)dx 

Jo 

a m pXQ 

p'{x)'^e'^^^^F{X < x)dx + J p'{x)‘^e‘^^^'^x^F{X > x)dxj 


< 2a^+^(m + a/^+^)Ep'(Xfe'^(^l 


( 2 . 5 ) 


Step 3. It remains to estimate the integral over the interval (xq V m,oo). In 
what follows we can assume that xq < oo and p(xo) — 0 (since if we subtract 
p(xo) from p both sides of (12.21) will change by a factor of exp(—y)(xo))). Fix 
5 = l/(2(m + and let xi = sup{x > xq : p'{x) < d} (with the convention 

that xi = Xq if p'(xq) > 5). Since p'{x) < 0 < p'{y) for 0 < x < xq < y we have 


L 


p'{x)p\y){e^'^^'> + e‘^^y^)F{X < x)P(X > y)dxdy 


xoVm Jo 


< 


L 


p'{x)p'{y){e‘^^^'> + e‘^(«))P(X < x)P(X > y)dxdy 


XQ\/m J xq 


< 2 

= 2 


xoVm J xq 

pOO 


p'{x)p'{y)e'^^'‘^'^F{X > y)dxdy 
v{y)v'{y)e-'^^'^^F{X > y)dy 


J XoVm 

pX\Vm 

= 2 / ^ p{y)p'{y)e'^^y'^F{X > y)dy 

J XoVm 


+ 2 


y^ivW{y)(V’'"'^^F{X > y)dy. 


( 2 . 6 ) 


/ x\Vm 


We will estimate the two integrals from ( 12 . 61 ) separately. 

Step 3 a. To estimate the first integral we define a convex function 
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The function tp is C^-smooth and 0 < ip'{x) < S tor x £ (xo,oo). Moreover 
ip{x) = (p(x) and ip'(x) = ^'(x) for x £ (xq V in, xi V m) (since this interval is 
contained in (xg, xi) or is an empty set). Also 0 < (p(x) < ip{x) and ^'{x^ < ‘^'{xY 
for a; G K. Hence (the third inequality follows from Lemma [2Tll 

^xiVm pca 

/ ^ ‘/5(?/)(^'(j/)e‘^(^¥(A > y)dy < ip{y)lp'{y)e'^'^^'>F{X > y)dy 

JxoVm JxoVm 

1 /*oo 


< 


< 


< 


xo Vm 

/3+1 r° 


T-/3+1 


{ifiyf + ip'{yf)e'^^y'>y>^¥{X > y)dy 
) 

ixiv? + ^'{yf)e^^y'‘dy{y). 

Xq 

/3 + 1 


(2.7) 


2 ~' 2 
In order to complete the estimation of the first integral we have to deal with the 
expression Fip{XYe^^^'>l{x>xo}- Using integration by parts (recall that ^{xg) = 
0), the Cauchy-Schwarz inequality and the fact that (f' is bounded by <5 we can 
write 

E^(A)2e^Wl{x>.o} 


= 2 

< 2 


(p'{x)(p{x)e'^^^'^F{X > x)dx + / p{xYp'{x)e^^^^F{X > x)dx 

J Xq 

1/2 


O noo ^ 2^/2 , poo ^ 

> x)dx\ f / ip'{xY> x)dx 

Xq '' ^Jxq 

pOO 

+ S ip{x)'^e^i^'^¥(X > x)dx. 

J Xq 


( 2 . 8 ) 


We can now use Lemma lOI to estimate the above integrals by expressions of the 
type E^(A)^e^(^H|x>a;(,} and E(^'(A)^e‘^^^^ respectively. For example 

poo 

/ (p{x)‘^e^^^^]P{X > x)dx 

d Xq 

pXQVrh poo 

<E / ip{x)^e^i^'^l[x>x}dx + / ip{x)'^e^i^'*xdF{X > x)dx 

^ Xq ^ XQVrh 

< {m + a^+^)Ep{X)‘^e‘^i^h^x>XQ} 

and similarly (at the end we skip the indicator function and replace ip by p) 

poo 

/ ^'(x)^e^U)p(x > x)dx < (to + {XY 

J Xq 

(note that we use the monotonicity of ip, ip' to assure that the assumptions of 
Lemma [2Tl are satisfied). Plugging this into (12.811 gives us 


■(e^'(x) 


2gV^(A) 


E^(A)2e^(^)l|x>.„} < 2(TO + a^+i)(E^(A)2e^W 

+ 6{m + uP+^)Ep{Xfe^i^h{x>XQ}, (2.9) 

which we solve with respect to Eip{XYe^i^'>l{x>xo} and (since (5 (to + (t^+^) = 1/2) 
arrive at 

E^(A)2e^(^)l{x>xo} < C{P,m,a)Ep'{XYe'^i^\ 

This allows us to finish the computations of (EB and finally conclude that 

^xi Vm 

p{y)p'(y)e'^iy'>F{X > y)dy < C{^, to, a)Ep'(X)^ (2.10) 


L 


XQVm 
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This ends the estimation of the first integral from (12.61) . 

Step 3b. To complete the proof we have to estimate the second integral from (1^ . 
Notice that if /3 = 0, then cci = oo since by assumption (p' < 6 = l/(2(m + cr)) = 
l/(2(m + 3cr)) and in this case that integral disappears. Therefore we can assume 
that /3 > 0. 

Fix e > 0. By convexity p{yY < p'iu)^{y — ■ Using Lemma I^TT] 

and Young’s inequality with exponents 1/(1 — (3) and 1//3 we get 


V>(yW > y)dy 


' Vm 


< 


l{j/e(xiVm,oo)}V5(2/)^ '^p'(yf"^^e'^^y^yl^V(X > y)dy 


( 2 . 11 ) 


< cr^+i / l{ye(xiVm.oo)}<p(j/)^ ^p'(yY+^dp(y) 

JR 

< £(1 - p)aP+^¥.p{X)e^^^h{x>.,yfk} 

+ CT^+^C'(e, ^)E| v?'(Y) I ^ 

We will estimate Ep{X)e‘^^^'>l^x>xiVfh} by a sum of expressions which appear 
in the assertion of the lemma and > y)dy. Then we will 

pick e = e(/i, m, a) small enough to get from (12.lip an estimate of the integral 
I^yfh ‘P(.y)‘P '^ y)dy which will end the estimation of the second inte¬ 
gral from (1^ and the proof of Lemma 12.41 Integration by parts gives 

Ep(X)e^^^h^x>x,wfh} = V > xi V m) 

( 2 . 12 ) 


p'iy)e‘^^y'^F{X > y)dy + / p{y)p'{y)e‘^^y'>F(X > y)dy, 


' x\\/m 


' xi Vm 


so it remains to estimate (^'(y)e‘^^^^P(Y > y)dy and the boundary term 

p(x\ V m)e‘^^^'^'^'^'>F(X > xiM fh). Since m > 1 and for x> Xi we have pYx) > 
we can write 


OO -j ^OO 

^ p'{y)e'^^y'^F{X > y)dy < t / ^ p'(yfey'^y'>y^F{X > y)dy 

1 Vm ^ J xiVm 


t/3+1 


< 


E(^'(Y)2e‘^W, 


As for the boundary term, we write it as 

E(((^(a:i V + 1) + - 1))l{x>.iv*}- 

It is easy to deal with the second part, since 


^2 


(2.13) 

(2.14) 

(2.15) 


To estimate the first part of the boundary term, we use the fact that ((/je”^—e”^-!-!)' = 
p'pe^ and p{xo)e'^^^°'' — -1-1 = 0. Therefore 


E(p(xi V m)e 

Vm 


ip{x\\/m) _ g</?(a;iVm) 


+ l)l{A>a;iVrfi} 


= E 


< 


px\ vm 

/ V(y)v5(y)e‘^^^^ciyi{A>a:ivm} 

J Xo 


Xq 

xi Vm 


(y)'y(y)ey'‘''^'^^(X > y)dy. 


(2.16) 
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Notice that we already estimated (f'{y)(p{y)e‘^^y'^¥{X > y)dy in Step 3a (see 

(I2.10ll 'l. so it is enough to estimate the integral over the interval (ccojXo V m). By 
convexity of we get 

nXoVrfi pXoVrfi 

/ >y)dy < / (p'(y)'^ {y - xo)e‘^^'^^'P{X > y)dy 

Jxq Jxq 

pXoVfh 

<mE p'{y)‘^e‘^^yh{x>y}dy<m‘^Eip\X)‘^e'^^^\ (2.17) 

J Xq 

where the last inequality follows from the fact that p'{XY> ip'(y )^for 
X > y > Xq. This (together with the previous inequalities (I2.14L (12.151) . (12.161) 1 
completes the estimation of the boundary term from (I2.12L i.e. we get 

p{xi V > a:i V m) < (-^ + + C[j3,m, a))¥jp'[X)"^ 

Therefore we get from (12.121) . (12.131) the announced estimate 


E^(X)e‘^Wl{;f>,,vm} < C(/3,m,a)E^'(X)2e^(^) 

pOO 

+ _ p{y)p'{y)e‘^^y'^F{X > y)dy. 

J x-]_ Vm 

As explained above, plugging this into (12.111) and taking e small enough, we deduce 
that 

POO 

/ ^ p{y)p'{y)e^^y^P{X > y)dy < C(/3,m,a)E(|(^'(A)|^ + (^'(A)")e‘"W 

J ail Vm 

(2.18) 

which ends the estimation of the second integral from (j2.6l) (and finishes Step 3b 
and also Step 3). 

Bringing together the above results of Step 1 (see (12.31) 1. Step 2 (see (12.41) . (12.51) 1 
and Step 3 (see (12.6p . (12.101) and (12.181) 1 completes the proof of the lemma. □ 

2.2. Proofs of Theorem ITT^ and Corollaries IIII.71 fTTSl 


Proof of Theorem 17.,71 We will follow Ledoux’s approach for bounded random vari¬ 
ables. Due to the tensorization property of the entropy (see e.g. [ini Chapter 5]) 
it is enough to prove the theorem for n = 1. Also, by a standard approximation 
argument, we can restrict our attention to convex Lipschitz functions only. Let Y 
be an independent copy of X. By Jensen’s inequality we have 

Ente‘^(^) = Fp{x)e^^^'> - logEe’^^'^^ 


<iE(^(A)-^(y))(e‘^W-e‘^(^)) 

= E((p(A) - ^(r))(e‘"W - e^^^^)l{x<Y} 

fY pY 

= E / ip\x)dx / ^\y)e'^^'^^dy\{x<Y} 


fx 


fx 


f [ (p\x)(p\y)e'^^'^^¥{X <xf\ y)'P{X > x V y)dxdy. 

Jr Jr 


We split the double integral into four integrals depending on the signs of x and y 
and use Lemmas 12.41 and 12.31 to obtain the desired inequality - note that C{—X) G 
Mp{m, cr^+^), so we can assume that the infimum of p is attained at some point 
of [0, oo] (possibly at oo), which in particular means that the assumptions of 
Lemma IQ are satisfied. Moreover, we can use Lemma 12.41 to handle the inte¬ 
gration over (—00,0)^ (again by a change of variables and the fact that £{—X) G 
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In the proofs of the corollaries we will use some additional notation and obser¬ 
vations. 

Remark 2.5. If we substitute instead of ip into dia and divide both sides by 
tending to zero, then we recover the Poincare inequality for convex functions: 

Var ..., X„) < 2C(/3, m, a)E|V(^(Xi,..., X„)p. 

A standard approximation argument shows that also for L-Lipschitz (but not nec¬ 
essarily smooth) convex functions we have Var(/5(Ai,..., A„) < 2(7(/3, m, 

Definition 2.6. For jS G [0,1] let Hp : R —?> [0, oo] be the function given by Hp{t) = 
max{t^, and let : M —>■ R be its Legendre transform. 

For /3 = 0 the definition of Hq should be understood as Ho{t) = + 

ooI{|t|>i}. Similarly, all indeterminate expressions below should be interpreted as 
appropriate limits for /3 —>■ O'*". The next lemma sums up some properties of the 
functions Hfj and H^. 

Lemma 2.7. (a) The function is given by the formula 

rtV4 */0<|t|<2, 

H;it)= l\t\-l ^f2<\t\<^^, 

(b) We have H^{t) > ^min{t^, 

(c) The derivative of the function Hp is given by the formula 

pt|sgn(t)/2 i/0<|t|<2, 

^H*^{t)= \sgn{t) */ 2 <|t|<m, 

[(;3fTl^l)'^sgn(t) i/|t|>^. 

(d) We have f^H*p{t) < < min{|t|, \tf}. 

Sketch of the proof. A straightforward calculation gives (a); (c) and (d) follow di¬ 
rectly from it. To prove (b), first notice that for every /3 G [0,1) there exists exactly 
one strictly positive number to such that tg/d = Since we have 

/3g[o.i] to/4 te[2,4] t^/4 

and the functions t^ and \t\^^^ are convex, we conclude that 

H}{t) > ^min|tV4, ^ min{tMt|i+'^}, 

where we also used the fact that 

inf =1/4- □ 

/3g[0,1] p\f3 + l) 

Proof of Corollary ] 1.6] for /3 > 0. We follow the classical Herbst argument (see e.g. 
[IS])- Denote A = sup{|V(/j(a::)| : x S R"}, B = sup{||V(/j(a:)||(/ 3 _|_i )//3 : x S R”} and 
for A > 0 define $(A) = . Then 

Ad>'(A) = EA^(Ai,..., 

and hence 

Ad>'(A) - d>(A)log$(A) = Ente^‘^(^i' 

< Ce((A|V(p(Ai, ..., A„)|)2 V (A||V(p(Ai,..., 
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After dividing both sides by A^<i)(A) we can rewrite this as 
(ilog$(A)y < ^((AA)^ V 

Since the right-hand side is an increasing function of A and lini>,^o+ ylog$(A) = 
E(^(Ai,..., A„), we deduce from the last inequality that 

i log$(A) < E^(Xi,..., X„) + C{{AXf V {BX)^^+^'>/^)/X, 

which is equivalent to 

EgAv(Xi,....x„) < exp(AE(/j(Ai,...,A„) -f C((AA)2 V {BX)^^+^'>/l^)). 


Therefore from Chebyshev’s inequality we get 


P((^(Ai, . . . , Xn) >1 + E(/?(Ai, . . . , Xn)) < 


]ggAv(A:i....,X„) 

exp(AE(p(Ai,..., Xn) + At) 


< exp(-At + C((AA)2 V {BX)^^+^^/^))). 


Now we can optimize the right-hand side with respect to A. Let K and L be such 
that A = K^/^L, B = We have 

((AA)2 V {BX)^^+^^^^)) = K{{LX)^ V = KHp{LX) 

and hence 


P((^(Ai, ...,Xn)>t + E(^(Ai,..., Xn)) < eM-CKH*p{t/CKL)). 

Using Lemma 12.71 and the definitions of K and L we get 

/ 3 r t^ t^+^ '1 \ 

P((/?(Ai,...,A„) > t-hE(p(Ai,...,A„)) < exp(^- — minj^^, 
which is the assertion of the lemma. □ 


Proof of Corollary \T^ for /3 = 0. We proceed exactly as in the case /3 > 0, with the 
only difference that we have the additional constraint A < \/{2B{rn + 3cr)) since 
the gradient of the function Atp has to be small. We arrive at 

F{ip{Xi,... ,Xn) >t + E(^(Ai,... ,Xn)) < exp(-At -I- CA^X^). 

Now we can optimize the right-hand side with respect to A G [0, l/{2B{m + 3cr))]. 
For t/{2CA^) > l/{2B{m + 3(t)) it is best to take A = \/{2B{m + 3(t)) for which 
the polynomial becomes 

- At + CA^X^ = -tflflBim P 3a)) + CA^jlflBim + 3a)f 

< —t/{AB{m + 3cr)) = — min{t/(4B(m -f 3 (t)), t^/(4C'A^)}. 
For tl{2CA^) < l/{2B{m + 3a)) we put A = tl{2CA?), which gives 

- At -f CA^X^ = -t^l{2CA^) + t^l{WA^) 

< -t^HWA^) = -min{t/(4B(m + 3o-)),tV(4C'A2)}. 
The assertion of the lemma follows from those two inequalities . □ 

Proof of Corollary \1 .'1\ We will consider only the case /3 > 0, the case /3 = 0 is 
similar. The function Lp is differentiable almost everywhere (with respect to the 
Lebesgue measure) and 


{X(p{x), h) < ip{x + h) — ip{x) a.e. 
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After plugging h = V(p{x) and h = {\di(p{x)\^/^ sgTi{di(p{x)))'^^i, and using the 
Lipschitz conditions we arrive at 

\\Vip[x)\\l<Ll a.e. 

Let (fie be the convolution of tp and a Gaussian kernel, i.e. Ps{x) = V,p{x + ^G) , 
where G ^ Af(0,I). The function is smooth and convex, and inherits from p 
the estimates of the gradient (which are now satisfied for all x G R"). Therefore 
from Corollarv ll.61 we get 

¥{peiXi,...,Xn) > t + Ep^{Xi,...,Xn)) < Bxp (^ - — min|^^, 

Moreover \pe{x) — p{x)\ < L 2 -\/e]E||G ||2 and hence pe converges uniformly to p as 
e tends to zero. This observation ends the proof of the lemma. □ 

We will now pass to the proof of Corollary 11.81 

Proof of Corollary We will present a detailed proof only for the case /3 > 0, the 
case /3 = 0 is similar. Throughout the proof we denote by C{f3,m,a) the constant 
from Theorem O and hy Xi, i = 1,... ,n, the coordinates of the vector x G K”. 
The function $(a;) = inf^g^ (xi — af) is convex. Therefore, the set 

n 

I = A + {x e R" : ^ H*^{xi) < 3r/16} = {x e R” : $(x) < 3r/16} 

i=l 

is open and convex. 

By Rademacher’s theorem, the function $ is differentiable almost everywhere 
(with respect to the Lebesgue measure) and hence we can choose a dense set Z? C A 
consisting of points for which V$ exists. We dehne 

p{x) = sup{$(y) + (V$(y), x - y)}. 

yGD 

This function is convex as a supremum of linear functions. Moreover < $ and 
(/? = $ on the set clA = {x S R" : $(x) < 3r/16}. 

Note that the set {x S R" : p{x) < 3r/16} is open and convex, contains the set 
A (since on A we have </? = $< 3r/16), and is disjoint with the set cl A \ A = 
{x G R" : <i>(x) = 3r/16} (since on cl A \ A we have p = ^ = 3r/16). Therefore 
{x G R" : p(x) < 3r/16} = A and hence 


{x e R" : p(x) < 3r/16} = A + {x S R" : ^ < 3r/16} 

2 = 1 
n 

C A + {x e R’^ : ^ x^ A |x,|i+^ < r} C A + r^/^B 2 + Bi+p, (2.19) 

2=1 

where we used Lemma UCf \ and the observation that if EEi ^ < r, then 

X can be written a,s y + z, y G r^^^B2, z G Bi+p (since we can put yi = xt 

if xf < |xi|^+^, yi = 0 otherwise, and Zi = Xi if xf > |xi|^+^, Zi = 0 otherwise). 

We will show that the inclusion 

{x S R" : p{x) < r/16 + Eiy9(Xi,..., X„)} C {x G R" : p{x) < 3r/16} 

holds for r big enough (greater than some constant depending only on G(/3, m, a)) 
and that p satisfies 

\p{x) - p{y)\ < L2 \\x - y\\2, \p{x) - p{y)\ < Li+/3||x - y||i +;3 
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with 

Ll = sup ||V$(y)||2 < 16r/3, = sup ||V$(y)||j;^ < 16r^/3. 

y^D ^ yeD 

By the preceding Corollary 11.71 the above claims together with (I2.19P will give 

P((Xi,..., X„) ^ A + ^1/2^2 + < exp(-CV) 

for r big enough and some constant C depending only on C{l3,m,a) and /3. By 
decreasing C we will be able to guarantee that this inequality holds for all r > 0 
(recall that P((^i,..., Xn) & A) > 1/2). 

We estimate the Lipschitz constants first. For y G D C Awe can choose G c\ A 
such that 

n n 

Hy) = /nf -Oi) = ^H*p{y, - a}(). 

“ i=l i=l 

Note that then y — G {a; € K" : Hp{xi) < 3r/16}. Let e* be the f-th vector 

of the canonical basis in R”. We have 

n n 

$(y + te”) - $(y) = mf ^ H*p{yj + tl{i=j} - Oj) - ^ - a]) 

1=1 1=1 

< Hliy, +t-a\)- H^iyi - of). 

The function t H> H^{yi + t — af) — Hp{yi — a/) — ^{y + te®) + $(y) attains its 
minimum for t = 0 and hence di^{y) = {H^y^yi — a\). Using Lemma I^TTl we arrive 
at |i9j^'(y)| < \vi - a\\ A \yi - a\\^. Therefore, 

n 

\\ym\\l<Y.^\y.-oy\^\y.-ayff 

n n 

<'^\yi- afy A \y,-afy+^ < 16^iL^(yi - a/)/3 < 16r/3 

and similarly 

n Q 

l|Vd>(y)||;;^i)/^ < (E A |y. - 

n B ^ 

< (E ^ \y^ - ^ (16E< 16ry3. 

i—1 1—1 

This finishes the estimation of the Lipschitz constants. 

In order to show that for r big enough we have the inclusion 

{a; G R” : ip{x) < r/16 + Wjip{Xi ,..., X„)} C {a; G R” : (p{x) < 3r/16}, 

we claim that E(/3(Xi,... ,Xn) < r/S for r > 2^^C'(/3, m, cr)/3. Indeed, if this was 
not the case, then we could write 

1/2 < P(^(Wi,..., < 0) < P(^(Xi,..., X„) < E(p(Xi,..., X„) - r/8) 

< P(|(/9(Xi,..., X„) - E(^(Xi,..., X„)| > r/8) < 64 Var ip{Xu ..., X„)/r2 

< 64 • 16/3 • 2C{fi, m, a)/r, 

where the first inequality follows from the fact that on the set A we have (/j = $ = 0, 
and the last inequality follows from the fact that y satisfies the Poincare inequality 
(see Remark l2.5p and y is y 16r/3-Lipschitz. This yields a contradiction. □ 
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3. A LINK TO TRANSPORT INEQUALITIES 

In this section we will present several equivalent formulations of modified log- 
Sobolev inequalities for convex functions in terms of transportation inequalities or 
T-log-Sobolev inequalities, in the spirit of m Theorem 6.8]. We would like to 
emphasize that our arguments in this part of the article follow quite closely the 
approach in m (in fact the authors of m , while working with the classical convex 
log-Sobolev inequality, i.e. /3 = 1, mention the possibility of extending their results 
to more general cost functions). In particular we do not claim any novelty of the 
methods we use. 

Our main motivation for providing an equivalent formulation of modified log- 
Sobolev inequalities for all (3 G [0,1] is the hope that it may help find a character¬ 
ization of measures satisfying the inequalities in question. In fact, one interesting 
observation which follows from our considerations is that modified log-Sobolev in¬ 
equalities are equivalent to formally stronger results in which the entropy is replaced 
by a larger functional (see Remark 13.41) . 

Since we will rely heavily on techniques developed by other authors, we decided 
to present only the main steps and ideas of the proofs. We include numerous 
references to appropriate sources for the Reader interested in all technical details. 

Before stating the actual results we have to introduce some notation which we 
will use throughout this section. 

For X = (a;i,. .., Xn) G R" and /? G (0,1] (we will treat the case /3 = 0 later) we 
define 

n n 

= Y.^p{x^), Hf^*{x) =Y,H*p{x,), 

where is the Legendre transform of the function iL/ 3 (s) = V (see 

Definition 12.611 . For t > 0, /3 G (0,1], and a function /: K" —>■ R we define the 
infimum-convolution operator 

Qtf{x)= mi{f{y) + tHf'^*{{x-y)lt)}€ [- 00 , 00 ). 

Let Pi(R”) denote the class of all probability measures on R” with finite first 
moment. If /j, G Pi(R"), we define the transport cost 

T{y\v) =\-ni{ j H^^^*{x- [ ydpa;{y))diy{x)}, 

v/R" JR" 

where the infimum is taken over all probability measures tt on R" x R" with the 
first marginal equal to u and the second to /x; here Px denotes the probability kernel 
such that 'iT{dxdy) = v(dx)px{dy). 

If /: R" —>■ R is bounded below by an affine function, we define moreover 

R^f{x) = inf{ [ f{y)dp{y) + XH^^* {x - [ ydp{y))} G [- 00 , 00 ) 

P jR" jR" 

where the infimum is taken over all probability measures p G Pi(R"). Note that 

R^f < /. 

Recall also, that for two probability measures /x, u on R" the relative entropy of 
u with respect to p, is given by the formula 


iL(u|/x)=/ log(^)du 

j-^n \a^/ 


if u is absolutely continuous with respect to p; otherwise we set H(iy\p) = -l-oo. 

The main result of this section provides a characterization of the modified log- 
Sobolev inequalities with /3 G (0,1] for convex functions in terms of transportation 
inequalities. 
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Proposition 3.1. Let X be an integrable random vector with values in R" and law 
/j,. For P S (0,1] the following conditions are equivalent. 

(i) There exists b > 0 such that for all probability measures v on R", < 

bH{v\p). 

(a) For all s > 0 we have < oo and there exist X,D > 0 such that 

< DE{f{X) - f{X))e^<^^^ (3.1) 

for all functions f: R" —>■ R such that Ef{X)e^^^'^ < oo and f is bounded 
below by an affine function. 

(Hi) For all s > 0 we have < oo and there exists C > 0 such that 

< CEH^^\XipiX))e‘^’-^^ (3.2) 


for all smooth convex Lipschitz functions (p: R” —>■ R. 

(iv) For all s > 0 we have < oo and there exists B > 0 such that for every 

to € [0,5], every t > 0, and every convex Lipschitz function (p: R" —>■ R 
bounded from below, 


llfc(i) - Ir llfc(o)’ 


(3.3) 


wherek{t) = {l + B ^(t- to))l{t<io} + (1 + 5 ^(t-to))^l{t>to}- ^^ere, 

for a positive random variable Z, we use the notation \\Z\\n = exp(EloKZ) 
anrf||Z||fc = (EZ'^)!/* if k ^ 0 . 

Moreover, in each of the above implications the constants in the conclusion de¬ 
pend only on the constants in the premise (in particular they do not depend on the 
dimension). 


Remark 3.2. Using the terminology and notation introduced in m the above 
proposition can be stated shorter: (i) means that the transport inequality T^(6) 
holds (for the cost function c{x,p) = — ydpljj))] see [11] Definition 5.1]) 

and (ii) is the r-log-Sobolev inequality, denoted (r) — LSIc(A, D) (for the same cost 
function; see[TTl Definition 6.1]). 


Remark 3.3. The exponential integrability of the norm in items (ii)-(iv) is in¬ 
troduced to exclude heavy tailed measures for which the only exponentially in¬ 
tegrable convex functions are constants. Using the observation that R^f{x) = 
infy{(/j(i/) -|- XH^^*(x — y)}, where ip is the greatest convex minorant of / (see [TT] 
Corollary 3.11 (2)]), one can easily see that such measures trivially satisfy (13.111 with 
D — 1 and arbitrary A > 0, while they cannot satisfy the inequality of point (i), as 
it implies subexponential concentration. 


Remark 3.4. As we will see from the proof (see (13.51) 1. the conditions of the propo¬ 
sition remain equivalent if we replace the entropy on the left-hand side of the 
inequalities in (ii) and (iii) by the (greater) expression 


/ ipe^dpL — / (fdpL / e^dpL 
JIR”' 


]^E{ip{X) - ip{Y)){e^^^^ - 


(where X, Y are independent with law p). Therefore the first estimate from the 
proof of Theorem 1 1.31 is actually quite sharp. Note that there does not exist a finite 
constant C such that CEnt > E(A — Y){e^ — e^) for all bounded i.i.d. random 
variables X and Y (as one can see by considering e.g. X with the distribution 
^S-a + \5a for a sufficiently large). 

Remark 3.5. In the proofs we will use a dual formulation of (i): the transport cost 
inequality holds if and only if we have 


of) Qiip(x 


'>dp[x)) 


v{x)dfj.(x) 


< 1 


(3.4) 
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for all convex Lipschitz functions ip: R" —^ R bounded from below (see El Propo¬ 
sition 5.5 (ii”) and Corollary 3.11 (2)]; note also that Qi^p = since tf is convex). 

For /3 = 0 a result similar to Proposition 13.11 holds, but we have to introduce 
a slightly different notation which will additionally depend on a number <5 > 0. 
Namely, for a: = (xi,..., x„) G R" we define 

n n 

= E = E KsAA 

i^l i=l 

where i?o,5(s) = Ho{s/S) = iJ“^s^l{|s|< 5 } -b ool{|s|> 5 } and H*g{s) = = 

|:(5^S^1{5|s|< 2} -b ((5|s| — l)l{ 5 |s|> 2 } is its Legendre transform. We define Qt,s, Ts, 
Rg as Qt, T, R^ above, but with H^^g and H^^g* in place of and 
respectively, i.e. we put 

Qt,sf{x) = ini {f{y)+tH'^^}*{{x-y)/t)}, 

yeR" 

Ts{y\v) = ini{ j {x - j ydpa,{y))dv{x)}, 

^ Jvx ' Jvx 

Rsfix)= inf { / f{y)dp{y) + \H''^}*{x- ( ydp{y))}. 

pePl(R") jRn jRn 

We then have the following 


Proposition 3.6. Let X he an integrable random vector with values in R" and law 
p. The following conditions are equivalent. 

(i) There exist (5i and 6 > 0 such that for all probability measures v on R", 
TsiihW) < bH{iy\p). 

(ii) There exist (52 > 0, A,I1 > 0 such that Ee^ < oo and 

EntefW < T>E(f(X) - Rlf(X))eR^^ 

for all functions f: R" —> R such that Ef(X)eR^'^ < oo and f is bounded 
below by an affine function. 

(Hi) There exist 6^,0 > 0 such that Ee^ ^o.ss ^ qq 
Ente'^^'^i < CEH^'^l{Xip{X))e‘^^^^ 


for all smooth convex Lipschitz functions p: R” —>■ R with ||Vvj||oo < S 3 . 

(iv) There exist Si,B > 0 such that Ee “’'’■ 4 ^ < 00 and for every to G 

[0, B], every t > 0, and every convex Lipschitz function p: R" —>■ R hounded 
from below, 

Ir llfc(t) - Ir llfc(o)’ 

where k{t) = B~^ min{l -b B~^{t — to), 1}. 


Moreover, in each of the above implications the constants in the conclusion de¬ 
pend only on the constants in the premise (in particular they do not depend on the 
dimension). 


Remark 3.7. The dependence of the constants is the following: (i) implies (ii) with 
A G (0,1/5), D = 1/(1 — A5), and 82 = i5i; (ii) implies (hi) with C = XD and 
i 53 = d 2 (A A {2D)~^)-, (iii) implies (iv) with B = C M \ and 84 , = ( 53 ; (iv) implies 
(i) with b = B and i5i = ^ 4 . Note also that for a > 1 we have Ho^aS < Ho^s and 
Hq s — aS — AH^g, and hence if the inequalities in (i), (ii), (iii), (iv) hold for 
some (5 > 0, then they also hold for every 8 ' G (0, (5). 
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Sketch of the proof of Proposition \3.1\ For simplicity we assume n = 1. To prove 
that (i) implies (ii) we follow the proof of m Proposition 6.3]. Fix A S (0,1/6) and 
let /: R —)> R be a function with fe^dp. < oo. We define Uf to be the measure 
with density e^ dpi/e^dfi). Let tt be a measure on R^ with the first marginal I'f 
and the second p,. We have Tr{dxdy) = Vf[dx)px{dy) for some probability kernel Px 
and hence 

/ [ fdy= f {f{x) - f{y))'K{dxdy) = [ {f{x)- [ f{y)dpx{y))dvf{x) 

Ar Jr Jr2 Jr Jr 

< [ {f{x)-R^f{x))di^f{x) + X [ H*p{x- [ ydpx{y))dvf{x), 
JK Jr 

where the last inequality follows from the definition of R^. Optimizing over all 
measures tt as above gives us 

/ fdvf- f fdy< I {f{x)-R^f{x))diyf{x)+XTiyWf) 

Jr Jr Jr 

< [ (fix) - R^f{x))dvf{x) +XbH{vf\y). 

Jr 

It follows from Jensen’s inequality that 

H{vf\y)= [ log ( I- )dvf < [ fduf- [ fdy 
Jr Jr Jr 

and hence we conclude from the preceding inequality that 

/ - [ fdy<-J-j 7 [ {f{x)- R^f{x))dyf{x). 

Jr Jr -L — ao 

By the definition oi Vf this means that 

[ fe^dy - [ fdy [ e^dy < ^ f {f{x) - R^f{x))eddy{x). (3.5) 

Jr Jr Jr ^ ~ Jr 

Therefore dm is satisfied with O = 1/(1 — A6) (the above inequality is formally 
stronger, since its left-hand side is by Jensen’s inequality greater than Ente^; see 
Remark 13.41) . 

It remains to prove the exponential integrability of the norm (we present the 
argument on the real line, however it can be easily adapted to R", e.g. if we decide 
to work with the Pf norm all the calculations are performed on each coordinate 
separately; clearly the choice of the norm for this problem is irrelevant). 

Consider the function y{x) = r\x\. By the dual formulation of the transport 
cost inequality (see Remark 13.51) and the assumption that X is integrable, we get 
Egb Qiv>{x) ^ jg gg^gy ggg j. |^| gnough (dependmg on r) the 

infimum in the definition of Qiip{x) is attained for y = x — sgn(x)((iF^)')“^(r), 
where {{Hpy)~^ is the inverse of {Hp)' restricted to [0, oo). Thus for jccj large 
enough, we have Qiip{x) > r\x\/2. Since r is arbitrary, we conclude that < 

oo for all s. 

To prove that (ii) implies (iii) note that if ip is convex, then 

ifix) - R^ifix) = supi ip{x) - ip{y)dp{y) - XHp{x - / ydp{y))} 
p Jr Jr 

< sup{ / ip'{x){x - y)dp{y) - XH*p{x - / ydp{y))} 
p Jr Jr 

<swp{ip'ix) ■ {x- / ydp{y)) - XHp{x - / ydp{y))}. 
p Jr Jr 
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We have uv < CHp{u) + XHp{v) for some constant C depending only on A and (3. 
Therefore 

f{x) — R^ipix) < CHi3{ip'{x)) 

and hence (hi) holds. 

To see that (hi) implies (iv) (with B = C) we follow the reasoning from the 
proof of |131 Theorem 1.11]. By a perturbation argument we can assume that 
/r is absolutely continuous. Indeed, if 7 is a Gaussian measure on R" with the 
covariance matrix being a sufficiently small multiple of identity, then the product 
measure on R" x R" satisfies the modified log-Sobolev inequality with constant 
C. Let (fi : R" —>■ R" be a smooth convex Lipschitz function and let $ : M" x 
R" —>• R" be defined by the formula $(a;,y) = (p{x + ey), x,y £ R. Applying the 
modified log-Sobolev inequality to the measure /i( 8)7 and the function $, we see that 
the convolution y * 7 ^, where 7 £(A) = 7 (A/e), satisfies the modified log-Sobolev 
inequality on R" with constant Ce —>■ C as e —?> 0. If one can prove that (1531) is 
satisfied by /i * 7 ^ with = C^, then one can obtain it for y with B = C hy the 
Lebesgue dominated convergence theorem. Note that if y is absolutely continuous, 
then another standard approximation shows that (iii) holds for all convex Lipschitz 
functions (by the Rademacher theorem the gradient is then almost surely well 
defined). 

If (/?: R —>■ R is a convex function, then so is Qt<f and it satisfies the Hamilton- 
Jacobi equation 

dtQty} + Hp{dj,Qty) = 0 (3.6) 

(see [B] Chapter 3.3.2]). More precisely, the function {t,x) ^ Qtip{x) is Lipschitz 
and the equation is satisfied a.e. with respect to the Lebesgue measure on (0, 00 ) x 
R". We fix to G [0, C], define 

k{t) = {1 + C ^(t — to))l{t<to} + (1 + C' ^/3 ^(t — to))^l{t>to}) 

and set F{t) = for t > 0. Using the absolute continuity of y 

together with integrability properties of Lipschitz functions, one can show that F 
is locally Lipschitz, F'{t) exists for almost all t > 0, and 

k{t)F'{t) = k{t) f e''^*^'^<^‘^^'"\k'(t)Qt(p{x) + k{t)dtQt(p{x))dy{x) 

Jr 

= k{t) [ {t)Qtip{x) - k{t)Hp{dj;Qt(p{x)))dy{x) 

Jr 

= k'{t)F{t) log F{t) + k'{t) Ent 

-k\t) [ e'^^*^^^‘^^^'>Hp{d,QMx))dy{x) 

Jr 

< k\t)F{t)\ogF{t)+Ck\t) [ e'^^^^^*‘^^^^Hy{k{t)d^Qt(p{x))dy{x) 

Jr 

- k'^{t) ( Hp{da:Qt(pix))dy{x) 

Jr 

<k'it)Fit)logFit) 

+ [ ■ (C'fc'(t)max{|fc(f)p, |fc(t)|^} - fc^(t)) 

Jr 

= k'(t)Fit) log Fit). 

where we used dim . the inequality (EH), the fact that 

Fdpiax) < max{|ap, \a\^^~^^'’/^}F[pix), 
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and the definition of k(t) in the last equality. The above differential inequality is 
equivalent to (log(F(t))/fc(t))' < 0 for almost all t > 0. Since Qt<f < (fi, we arrive 
at 


< faint 

k(t) - s^ 0 + k(s) 


< lim 

s—>0+ 


log(4e^(-)^Wd/i(rE)) 

k(s) 


log||e^(^)||fe(o), 


which is equivalent to the assertion. 

Finally, (iv) implies (i) (with b = BVl). Indeed, we plug to = B and t = BAl < 1 
(so that k{t) = [By 1)“^ and k{0) = 0) into inequality (Id.dll , and using the fact 
that Qt<p decreases with t we arrive at a dual formulation of the transport cost 
inequality (see Remark Id.511 . This ends the proof of the last implication. □ 


The proof in the case /3 = 0 follows roughly the sames lines. Below we sum up 
the more important changes one has to make. 

Sketch of the proof of Proposition \S.6[ For simplicity we assume n = 1. To prove 
that (i) implies the entropy bound of (ii) (with <52 = ^i) we follow exactly the 
same reasoning as above, in the proof of Proposition Id.II As for the exponential 
integrability of , we again consider (p[x) = r\x\ and note that by the dual 

formulation of (i), Ee^ ^ But, as one can easily see, the boundedness 

of the derivative of implies that for r sufficiently large Qi^SiP{x) = infj,{r|y| + 

Hq j^[x — y)} — Hq (x), which shows that Ee^ ^ 0 , 6 ^ (^) 

To prove that (ii) implies (iii) with 5^ = S 2 [X A [2D)~^) and C = XD, notice 
that if ip is convex and \ip'\ < da, then ¥jip{X)e‘^^^'^ < oo and 

ip{x) - RsM^) ^ sup{(p'(x) -{x- ydp[y)) - XH^g^ {x- ydpiy))}. 

p Jr Jr 

We have 


uv < {XHlgJ*{u) + XH^g^iv) = XHo,sAu/X) + XH^g^iv) 

= XHo,xs,{u) + XHlg^[v) < XHo,s,{u) + XH*^g^[v). 

Therefore ip{x) — Rg^ip[x) < XHo^s^ip'{x}) and hence (iii) holds. 

As in the case ,5 > 0, for the proof that (iii) implies (iv) (with 5^ = da and 
B = C y 1) we can assume that y is absolutely continuous. Thus, by standard 
approximation arguments, the log-Sobolev inequality of (iii) is satisfied for all con¬ 
vex Lipschitz functions (the gradient is then almost surely well defined). In what 
follows, without loss of generality, we will also assume that C > 1. 

Note that \dxtH^g^[[x - y)/t)\ = |(FIo_ 53 )'((x - y)/t)\ < da, so for t > 0, x H> 
Qt,5^ip[x) is da-Lipschitz. An adaptation of the reasoning from [SJ Chapter d.d.2] 
shows that (t, x) i—?> Qt^s^pix) is Lipschitz and 

P [dxQtjS^p') —0 

Lebesgue a.e. on (0, oo) x R (we remark that for n > 1 one needs to work with the 
rather than with the Euclidean norm to get that 11 —7> Qt^s^ip[x) is Lipschitz). 
Next, one shows that for any C >C, the function F[t) = 
t > 0, is well defined, where we have denoted 

k(t) = C~^ min{(l -|- C~^(t - to)), I}, 

and to € [0, C] is fixed number. Indeed, 

k{t)Qt,53P>{x) < k[t)tHo g^[x/t) + k[t)ip[Q) < fc(t)d3|x| -I- k[t)ip[t)). 
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For \x\ > 2/^3 the right-hand side above is bounded by C~^{Hq g^{x) -|- 1 -|- i^(0)), 
which together with the assumed exponential integrability of shows that the 
integral defining F is finite. 

Using absolute continuity of F and again exponential integrability of Fl^ one 
proves that for C > C, F is differentiable a.e. Moreover, the same argument as 
for /? > 0 shows that F satisfies k{t)F'{t) < F{t)F(t) log F{t) for almost all t > 0. 
Let us list the additional observations needed to carry out the calculations. First, 
\dxk{t)Qt, 53 (p\ < \dxQt,S 3 '^\ < ^ 3 , which allows us to use the modified log-Sobolev 
inequality. Second, we have F[o^s3{k{t)dxQt,S3^{x)) = k(t)'^F[o^s3idxQt,S3^ix)). Fi¬ 
nally Ck'{t)k{t)‘^ < k{t)'^. We remark that to verify the above properties one uses 
the assumption that C > 1. 

Integration of the differential inequality and taking the limit C —>■ (7 ends the 
proof of the implication. 

Finally, (iv) implies (i) (with b = B and (5i = (54) as in the case /3 > 0: it is 
enough to check that 


( f "Qi.s3v>i^)d^{x))^e- 

Jr 


< 1 


holds for all convex Lipschitz functions R —>■ K bounded from below (see Re¬ 
mark ESI). This is easy to recover from the hypercontractive inequality in (iv). If 
B < 1 one sets to = t = B and uses the inequality Qi,S 3 ^ < Qt,S 3 'f, otherwise one 
sets < = to = 1- n 

Remark 3.8. As mentioned in the Introduction, very recently Feldheim et al. [7] 
proved that in the case of symmetric measures on R, the condition /i G A4o{m,a) 
for some m, tr is equivalent to the infimum convolution inequality 


cQi.svix 


^dfj-ix) f e '^^^^dfi{x) < I 

Jr 


for some 6, which is a formally stronger property than the inequality (i) of Propo¬ 
sition EiH (see Remark Em. Thus for = 0 the modified log-Sobolev inequality is 
in fact equivalent to a stronger concentration property. Gozlan et al. in m ex¬ 
tended the characterization of [7] to much more general transport costs, including 
all the cases considered by us. It would be interesting to provide an intrinsic char¬ 
acterization also for measures on the line satisfying general modified log-Sobolev 
inequalities. One expects that for /3 G (0,1] such measures form a strictly larger 
class than the class of measures satisfying the inequality of m- 


4. A REMARK ABOUT THE NON-CONVEX SETTING 

As we noted in Remark Em a Borel probability measures satisfying inequality 
(Em for all smooth convex functions (with bounded derivatives), satisfies also the 
Poincare inequality for all smooth convex functions. It follows from [3 Theorem 1.4] 
that in this case fi G Mo(rn, a) for some m and a. 

We will now show that sufficiently nice Borel probability measures /i which sat¬ 
isfy the modified logarithmic Sobolev inequality (Em for all (not necessarily log- 
convex) smooth functions belong to the class A4/3(m, (T^’*'^) for some m and a. 
More precisely, for /3 > 0 we have the following result. 

Proposition 4.1. Le fi be a Borel probability measure on R, absolutely continuous 
with respect to the Lebesgue measure. Suppose there exist constants C < oo and 
P G (0,1] such that for all smooth functions 

Ent/(X)2 < CE(/'(A)2 V \f{X)\^), 
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where X is a random variable with law /i. Denote by M the median and by n the 
density of fj,, and assume also that 

-| pxV M 

—n{x)~^ > £ n{t)~^dt 

P JxAM 

for some e > 0 and all x ^ M. Then there exist constants m,a < oo, such that 
G Mp{m, 


Proof. It follows from the Barthe-Roberto criterion (see [5J Theorem 10]) that for 
some constant K and all y > M, 


^([?/,oo))log- 


1 




1//3 


< K. 


T{[y,oo))\jM 

Holder’s inequality (for exponents /3 + 1 and (/3 + l)//3) gives us 

.x+l/x!^ l/(;3+l) i-x+l/xP 

<L Ux ^ 


(4.1) 


/3/(/3+l) 


and therefore 


pl+/3 


< 


o \ 1/-S 

n{t)~^dtj yL{[x,x + \/x^)). 


Since /x has the concentration property of order 1 + /3, using (SH) ioT y = x + l/x^ 
we get that 

n / nx+l/x^ \ -1//3 

—x^^^yL{[x + l/x^,OG))<yj n(t)~^dtj < x^'^^^j,{[x,x + l/x^)) 

for some constant D and x large enough. Hence, for x large enough, 

1 


^([a; + l/x^,oo)) < 


:fi{[x + 1/x^, oo)). 


1 + D/K 

Since we can deal analogously with the left tail, the claim follows from Proposi¬ 
tion 11.21 □ 


Remark 4.2. As proved in [T], in the case of /3 = 1 we do not need the assumption 
of absolute continuity, as in this case the Barthe-Roberto criterion can be replaced 
by a more general Bobkov-Gotze criterion [4], in which one does not require the 
existence of a density. 
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